Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 99112 |
| Missing cells | 16251 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 23.9 MiB |
| Average record size in memory | 253.3 B |
Variable types
| Categorical | 13 |
|---|---|
| DateTime | 2 |
| Numeric | 13 |
customer_id has a high cardinality: 99112 distinct values | High cardinality |
order_approved_at has a high cardinality: 90411 distinct values | High cardinality |
order_delivered_carrier_date has a high cardinality: 80749 distinct values | High cardinality |
order_delivered_customer_date has a high cardinality: 95394 distinct values | High cardinality |
order_estimated_delivery_date has a high cardinality: 423 distinct values | High cardinality |
product_most_frequent has a high cardinality: 31695 distinct values | High cardinality |
customer_unique_id has a high cardinality: 95780 distinct values | High cardinality |
customer_city has a high cardinality: 4116 distinct values | High cardinality |
product_category_name_english has a high cardinality: 72 distinct values | High cardinality |
payment_value is highly overall correlated with sum_price and 2 other fields | High correlation |
sum_price is highly overall correlated with payment_value and 1 other fields | High correlation |
sum_freight_value is highly overall correlated with payment_value | High correlation |
product_weight_g is highly overall correlated with payment_value and 4 other fields | High correlation |
product_length_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
product_height_cm is highly overall correlated with product_weight_g | High correlation |
product_width_cm is highly overall correlated with product_weight_g and 1 other fields | High correlation |
order_status is highly imbalanced (91.5%) | Imbalance |
payment_type is highly imbalanced (61.1%) | Imbalance |
order_delivered_carrier_date has 1735 (1.8%) missing values | Missing |
order_delivered_customer_date has 2908 (2.9%) missing values | Missing |
customer_id is uniformly distributed | Uniform |
order_approved_at is uniformly distributed | Uniform |
order_delivered_carrier_date is uniformly distributed | Uniform |
order_delivered_customer_date is uniformly distributed | Uniform |
customer_unique_id is uniformly distributed | Uniform |
customer_id has unique values | Unique |
length_comment_title has 86798 (87.6%) zeros | Zeros |
length_comment_message has 57807 (58.3%) zeros | Zeros |
product_description_lenght has 1418 (1.4%) zeros | Zeros |
product_photos_qty has 1418 (1.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-10 09:55:27.080754 |
|---|---|
| Analysis finished | 2023-02-10 09:56:10.432704 |
| Duration | 43.35 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
customer_id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 99112 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 9ef432eb6251297304e76186b10a928d | 1 |
|---|---|
| 30fe36e40e801f6f55cce8ee4aae9da3 | 1 |
| a4fe94a051d268fbbe8e4ca932ebc460 | 1 |
| ba712872211b52224c61d5bedfc1bfcf | 1 |
| f8b67d327058afa39382991d7173b1d7 | 1 |
| Other values (99107) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3171584 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 99112 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 9ef432eb6251297304e76186b10a928d |
|---|---|
| 2nd row | b0830fb4747a6c6d20dea0b8c802d7ef |
| 3rd row | 41ce2a54c0b03bf3443c3d931a367089 |
| 4th row | f88197465ea7920adcdbec7375364d82 |
| 5th row | 8ab97904e6daea8866dbdbc4fb7aad2c |
Common Values
| Value | Count | Frequency (%) |
| 9ef432eb6251297304e76186b10a928d | 1 | < 0.1% |
| 30fe36e40e801f6f55cce8ee4aae9da3 | 1 | < 0.1% |
| a4fe94a051d268fbbe8e4ca932ebc460 | 1 | < 0.1% |
| ba712872211b52224c61d5bedfc1bfcf | 1 | < 0.1% |
| f8b67d327058afa39382991d7173b1d7 | 1 | < 0.1% |
| 110b79f06a0f49a38da99084706a382d | 1 | < 0.1% |
| 366b4b63cda57be7ca46ecb33ee71f4e | 1 | < 0.1% |
| 52798469029a20d7814d2b58c0c63e0d | 1 | < 0.1% |
| 4ddbeaafc3eff2a014e49052df6c530f | 1 | < 0.1% |
| ab994eee6b515cbcf023c206bf29ec08 | 1 | < 0.1% |
| Other values (99102) | 99102 |
Length
| Value | Count | Frequency (%) |
| 9ef432eb6251297304e76186b10a928d | 1 | < 0.1% |
| 8b212b9525f9e74e85e37ed6df37693e | 1 | < 0.1% |
| 503740e9ca751ccdda7ba28e9ab8f608 | 1 | < 0.1% |
| ed0271e0b7da060a393796590e7b737a | 1 | < 0.1% |
| 9bdf08b4b3b52b5526ff42d37d47f222 | 1 | < 0.1% |
| f54a9f0e6b351c431402b8461ea51999 | 1 | < 0.1% |
| 31ad1d1b63eb9962463f764d4e6e0c9d | 1 | < 0.1% |
| 494dded5b201313c64ed7f100595b95c | 1 | < 0.1% |
| a166da34890074091a942054b36e4265 | 1 | < 0.1% |
| 7711cf624183d843aafe81855097bc37 | 1 | < 0.1% |
| Other values (99102) | 99102 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 198650 | 6.3% |
| 2 | 198612 | 6.3% |
| f | 198600 | 6.3% |
| c | 198574 | 6.3% |
| 1 | 198471 | 6.3% |
| 8 | 198465 | 6.3% |
| b | 198460 | 6.3% |
| 3 | 198401 | 6.3% |
| 7 | 198265 | 6.3% |
| e | 198086 | 6.2% |
| Other values (6) | 1187000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1981935 | |
| Lowercase Letter | 1189649 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 198650 | |
| 2 | 198612 | |
| 1 | 198471 | |
| 8 | 198465 | |
| 3 | 198401 | |
| 7 | 198265 | |
| 6 | 198083 | |
| 9 | 198032 | |
| 0 | 197652 | |
| 4 | 197304 |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 198600 | |
| c | 198574 | |
| b | 198460 | |
| e | 198086 | |
| a | 198006 | |
| d | 197923 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1981935 | |
| Latin | 1189649 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 198650 | |
| 2 | 198612 | |
| 1 | 198471 | |
| 8 | 198465 | |
| 3 | 198401 | |
| 7 | 198265 | |
| 6 | 198083 | |
| 9 | 198032 | |
| 0 | 197652 | |
| 4 | 197304 |
Latin
| Value | Count | Frequency (%) |
| f | 198600 | |
| c | 198574 | |
| b | 198460 | |
| e | 198086 | |
| a | 198006 | |
| d | 197923 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3171584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 198650 | 6.3% |
| 2 | 198612 | 6.3% |
| f | 198600 | 6.3% |
| c | 198574 | 6.3% |
| 1 | 198471 | 6.3% |
| 8 | 198465 | 6.3% |
| b | 198460 | 6.3% |
| 3 | 198401 | 6.3% |
| 7 | 198265 | 6.3% |
| e | 198086 | 6.2% |
| Other values (6) | 1187000 |
order_status
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| delivered | |
|---|---|
| shipped | 1098 |
| unavailable | 602 |
| canceled | 599 |
| processing | 299 |
| Other values (3) | 303 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.9838566 |
| Min length | 7 |
Characters and Unicode
| Total characters | 890408 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | delivered |
|---|---|
| 2nd row | delivered |
| 3rd row | delivered |
| 4th row | delivered |
| 5th row | delivered |
Common Values
| Value | Count | Frequency (%) |
| delivered | 96211 | |
| shipped | 1098 | 1.1% |
| unavailable | 602 | 0.6% |
| canceled | 599 | 0.6% |
| processing | 299 | 0.3% |
| invoiced | 296 | 0.3% |
| created | 5 | < 0.1% |
| approved | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| delivered | 96211 | |
| shipped | 1098 | 1.1% |
| unavailable | 602 | 0.6% |
| canceled | 599 | 0.6% |
| processing | 299 | 0.3% |
| invoiced | 296 | 0.3% |
| created | 5 | < 0.1% |
| approved | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 292138 | |
| d | 194422 | |
| i | 98802 | 11.1% |
| l | 98014 | 11.0% |
| v | 97111 | 10.9% |
| r | 96517 | 10.8% |
| p | 2499 | 0.3% |
| a | 2412 | 0.3% |
| c | 1798 | 0.2% |
| n | 1796 | 0.2% |
| Other values (7) | 4899 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 890408 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 292138 | |
| d | 194422 | |
| i | 98802 | 11.1% |
| l | 98014 | 11.0% |
| v | 97111 | 10.9% |
| r | 96517 | 10.8% |
| p | 2499 | 0.3% |
| a | 2412 | 0.3% |
| c | 1798 | 0.2% |
| n | 1796 | 0.2% |
| Other values (7) | 4899 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 890408 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 292138 | |
| d | 194422 | |
| i | 98802 | 11.1% |
| l | 98014 | 11.0% |
| v | 97111 | 10.9% |
| r | 96517 | 10.8% |
| p | 2499 | 0.3% |
| a | 2412 | 0.3% |
| c | 1798 | 0.2% |
| n | 1796 | 0.2% |
| Other values (7) | 4899 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 890408 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 292138 | |
| d | 194422 | |
| i | 98802 | 11.1% |
| l | 98014 | 11.0% |
| v | 97111 | 10.9% |
| r | 96517 | 10.8% |
| p | 2499 | 0.3% |
| a | 2412 | 0.3% |
| c | 1798 | 0.2% |
| n | 1796 | 0.2% |
| Other values (7) | 4899 | 0.6% |
| Distinct | 98546 |
|---|---|
| Distinct (%) | 99.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Minimum | 2017-01-05 11:56:06 |
|---|---|
| Maximum | 2018-10-17 17:30:18 |
order_approved_at
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 90411 |
|---|---|
| Distinct (%) | 91.4% |
| Missing | 154 |
| Missing (%) | 0.2% |
| Memory size | 1.5 MiB |
| 2018-02-27 04:31:10 | 9 |
|---|---|
| 2018-02-27 04:31:01 | 7 |
| 2018-07-05 16:33:01 | 7 |
| 2018-02-06 05:31:52 | 7 |
| 2018-01-10 10:32:03 | 7 |
| Other values (90406) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1880202 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 83367 ? |
|---|---|
| Unique (%) | 84.2% |
Sample
| 1st row | 2017-10-02 11:07:15 |
|---|---|
| 2nd row | 2018-07-26 03:24:27 |
| 3rd row | 2018-08-08 08:55:23 |
| 4th row | 2017-11-18 19:45:59 |
| 5th row | 2018-02-13 22:20:29 |
Common Values
| Value | Count | Frequency (%) |
| 2018-02-27 04:31:10 | 9 | < 0.1% |
| 2018-02-27 04:31:01 | 7 | < 0.1% |
| 2018-07-05 16:33:01 | 7 | < 0.1% |
| 2018-02-06 05:31:52 | 7 | < 0.1% |
| 2018-01-10 10:32:03 | 7 | < 0.1% |
| 2017-12-05 10:30:42 | 7 | < 0.1% |
| 2017-11-07 07:30:38 | 7 | < 0.1% |
| 2017-11-07 07:30:29 | 7 | < 0.1% |
| 2017-11-07 07:30:48 | 6 | < 0.1% |
| 2018-07-23 11:31:25 | 6 | < 0.1% |
| Other values (90401) | 98888 | |
| (Missing) | 154 | 0.2% |
Length
| Value | Count | Frequency (%) |
| 2018-04-24 | 990 | 0.5% |
| 2017-11-24 | 799 | 0.4% |
| 2017-11-25 | 754 | 0.4% |
| 2018-07-05 | 697 | 0.4% |
| 2017-11-28 | 506 | 0.3% |
| 2018-08-07 | 444 | 0.2% |
| 2017-12-05 | 426 | 0.2% |
| 2018-08-20 | 426 | 0.2% |
| 2018-05-08 | 426 | 0.2% |
| 2018-01-22 | 408 | 0.2% |
| Other values (42180) | 192040 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 317386 | |
| 1 | 304121 | |
| 2 | 240526 | |
| - | 197916 | |
| : | 197916 | |
| 98958 | 5.3% | |
| 8 | 98198 | 5.2% |
| 5 | 95335 | 5.1% |
| 3 | 92887 | 4.9% |
| 7 | 87855 | 4.7% |
| Other values (3) | 149104 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1385412 | |
| Dash Punctuation | 197916 | 10.5% |
| Other Punctuation | 197916 | 10.5% |
| Space Separator | 98958 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 317386 | |
| 1 | 304121 | |
| 2 | 240526 | |
| 8 | 98198 | 7.1% |
| 5 | 95335 | 6.9% |
| 3 | 92887 | 6.7% |
| 7 | 87855 | 6.3% |
| 4 | 68666 | 5.0% |
| 6 | 42186 | 3.0% |
| 9 | 38252 | 2.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 197916 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 197916 |
Space Separator
| Value | Count | Frequency (%) |
| 98958 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1880202 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 317386 | |
| 1 | 304121 | |
| 2 | 240526 | |
| - | 197916 | |
| : | 197916 | |
| 98958 | 5.3% | |
| 8 | 98198 | 5.2% |
| 5 | 95335 | 5.1% |
| 3 | 92887 | 4.9% |
| 7 | 87855 | 4.7% |
| Other values (3) | 149104 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1880202 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 317386 | |
| 1 | 304121 | |
| 2 | 240526 | |
| - | 197916 | |
| : | 197916 | |
| 98958 | 5.3% | |
| 8 | 98198 | 5.2% |
| 5 | 95335 | 5.1% |
| 3 | 92887 | 4.9% |
| 7 | 87855 | 4.7% |
| Other values (3) | 149104 |
order_delivered_carrier_date
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 80749 |
|---|---|
| Distinct (%) | 82.9% |
| Missing | 1735 |
| Missing (%) | 1.8% |
| Memory size | 1.5 MiB |
| 2018-05-09 15:48:00 | 47 |
|---|---|
| 2018-05-10 18:29:00 | 32 |
| 2018-05-07 12:31:00 | 21 |
| 2018-05-02 15:15:00 | 16 |
| 2018-07-24 16:07:00 | 16 |
| Other values (80744) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1850163 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 70660 ? |
|---|---|
| Unique (%) | 72.6% |
Sample
| 1st row | 2017-10-04 19:55:00 |
|---|---|
| 2nd row | 2018-07-26 14:31:00 |
| 3rd row | 2018-08-08 13:50:00 |
| 4th row | 2017-11-22 13:39:59 |
| 5th row | 2018-02-14 19:46:34 |
Common Values
| Value | Count | Frequency (%) |
| 2018-05-09 15:48:00 | 47 | < 0.1% |
| 2018-05-10 18:29:00 | 32 | < 0.1% |
| 2018-05-07 12:31:00 | 21 | < 0.1% |
| 2018-05-02 15:15:00 | 16 | < 0.1% |
| 2018-07-24 16:07:00 | 16 | < 0.1% |
| 2018-07-17 14:16:00 | 15 | < 0.1% |
| 2018-05-16 13:44:00 | 15 | < 0.1% |
| 2018-08-03 15:10:00 | 15 | < 0.1% |
| 2018-08-08 15:01:00 | 15 | < 0.1% |
| 2018-06-08 14:40:00 | 14 | < 0.1% |
| Other values (80739) | 97171 | |
| (Missing) | 1735 | 1.8% |
Length
| Value | Count | Frequency (%) |
| 2017-11-28 | 707 | 0.4% |
| 2017-11-27 | 673 | 0.3% |
| 2017-11-29 | 566 | 0.3% |
| 2018-02-27 | 523 | 0.3% |
| 2018-03-27 | 511 | 0.3% |
| 2018-08-06 | 510 | 0.3% |
| 2017-11-30 | 489 | 0.3% |
| 2018-08-13 | 472 | 0.2% |
| 2018-05-15 | 451 | 0.2% |
| 2018-05-03 | 450 | 0.2% |
| Other values (37379) | 189402 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 338077 | |
| 1 | 287701 | |
| 2 | 229707 | |
| - | 194754 | |
| : | 194754 | |
| 8 | 103071 | 5.6% |
| 97377 | 5.3% | |
| 7 | 88674 | 4.8% |
| 3 | 81737 | 4.4% |
| 4 | 76783 | 4.2% |
| Other values (3) | 157528 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1363278 | |
| Dash Punctuation | 194754 | 10.5% |
| Other Punctuation | 194754 | 10.5% |
| Space Separator | 97377 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 338077 | |
| 1 | 287701 | |
| 2 | 229707 | |
| 8 | 103071 | 7.6% |
| 7 | 88674 | 6.5% |
| 3 | 81737 | 6.0% |
| 4 | 76783 | 5.6% |
| 5 | 74515 | 5.5% |
| 6 | 42553 | 3.1% |
| 9 | 40460 | 3.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 194754 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 194754 |
Space Separator
| Value | Count | Frequency (%) |
| 97377 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1850163 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 338077 | |
| 1 | 287701 | |
| 2 | 229707 | |
| - | 194754 | |
| : | 194754 | |
| 8 | 103071 | 5.6% |
| 97377 | 5.3% | |
| 7 | 88674 | 4.8% |
| 3 | 81737 | 4.4% |
| 4 | 76783 | 4.2% |
| Other values (3) | 157528 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1850163 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 338077 | |
| 1 | 287701 | |
| 2 | 229707 | |
| - | 194754 | |
| : | 194754 | |
| 8 | 103071 | 5.6% |
| 97377 | 5.3% | |
| 7 | 88674 | 4.8% |
| 3 | 81737 | 4.4% |
| 4 | 76783 | 4.2% |
| Other values (3) | 157528 |
order_delivered_customer_date
Categorical
HIGH CARDINALITY  MISSING  UNIFORM 
| Distinct | 95394 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 2908 |
| Missing (%) | 2.9% |
| Memory size | 1.5 MiB |
| 2018-05-14 20:02:44 | 3 |
|---|---|
| 2018-05-08 23:38:46 | 3 |
| 2018-05-08 19:36:48 | 3 |
| 2018-02-14 21:09:19 | 3 |
| 2017-06-19 18:47:51 | 3 |
| Other values (95389) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1827876 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 94591 ? |
|---|---|
| Unique (%) | 98.3% |
Sample
| 1st row | 2017-10-10 21:25:13 |
|---|---|
| 2nd row | 2018-08-07 15:27:45 |
| 3rd row | 2018-08-17 18:06:29 |
| 4th row | 2017-12-02 00:28:42 |
| 5th row | 2018-02-16 18:17:02 |
Common Values
| Value | Count | Frequency (%) |
| 2018-05-14 20:02:44 | 3 | < 0.1% |
| 2018-05-08 23:38:46 | 3 | < 0.1% |
| 2018-05-08 19:36:48 | 3 | < 0.1% |
| 2018-02-14 21:09:19 | 3 | < 0.1% |
| 2017-06-19 18:47:51 | 3 | < 0.1% |
| 2017-12-02 00:26:45 | 3 | < 0.1% |
| 2018-07-24 21:36:42 | 3 | < 0.1% |
| 2017-12-06 18:30:10 | 2 | < 0.1% |
| 2018-02-01 20:29:51 | 2 | < 0.1% |
| 2018-04-23 16:45:44 | 2 | < 0.1% |
| Other values (95384) | 96177 | |
| (Missing) | 2908 | 2.9% |
Length
| Value | Count | Frequency (%) |
| 2018-08-27 | 446 | 0.2% |
| 2018-08-13 | 442 | 0.2% |
| 2018-05-14 | 434 | 0.2% |
| 2018-05-21 | 431 | 0.2% |
| 2018-05-18 | 425 | 0.2% |
| 2018-04-11 | 413 | 0.2% |
| 2017-12-11 | 412 | 0.2% |
| 2018-07-03 | 410 | 0.2% |
| 2018-05-03 | 409 | 0.2% |
| 2017-06-19 | 405 | 0.2% |
| Other values (41614) | 188181 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 281486 | |
| 0 | 280536 | |
| 2 | 243070 | |
| - | 192408 | |
| : | 192408 | |
| 8 | 113401 | |
| 96204 | 5.3% | |
| 3 | 88895 | 4.9% |
| 7 | 88844 | 4.9% |
| 4 | 83193 | 4.6% |
| Other values (3) | 167431 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1346856 | |
| Dash Punctuation | 192408 | 10.5% |
| Other Punctuation | 192408 | 10.5% |
| Space Separator | 96204 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 281486 | |
| 0 | 280536 | |
| 2 | 243070 | |
| 8 | 113401 | |
| 3 | 88895 | 6.6% |
| 7 | 88844 | 6.6% |
| 4 | 83193 | 6.2% |
| 5 | 77959 | 5.8% |
| 6 | 47877 | 3.6% |
| 9 | 41595 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 192408 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 192408 |
Space Separator
| Value | Count | Frequency (%) |
| 96204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1827876 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 281486 | |
| 0 | 280536 | |
| 2 | 243070 | |
| - | 192408 | |
| : | 192408 | |
| 8 | 113401 | |
| 96204 | 5.3% | |
| 3 | 88895 | 4.9% |
| 7 | 88844 | 4.9% |
| 4 | 83193 | 4.6% |
| Other values (3) | 167431 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1827876 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 281486 | |
| 0 | 280536 | |
| 2 | 243070 | |
| - | 192408 | |
| : | 192408 | |
| 8 | 113401 | |
| 96204 | 5.3% | |
| 3 | 88895 | 4.9% |
| 7 | 88844 | 4.9% |
| 4 | 83193 | 4.6% |
| Other values (3) | 167431 |
order_estimated_delivery_date
Categorical
| Distinct | 423 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2017-12-20 00:00:00 | 522 |
|---|---|
| 2018-03-12 00:00:00 | 516 |
| 2018-03-13 00:00:00 | 513 |
| 2018-05-29 00:00:00 | 513 |
| 2018-02-14 00:00:00 | 507 |
| Other values (418) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 1883128 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2017-10-18 00:00:00 |
|---|---|
| 2nd row | 2018-08-13 00:00:00 |
| 3rd row | 2018-09-04 00:00:00 |
| 4th row | 2017-12-15 00:00:00 |
| 5th row | 2018-02-26 00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2017-12-20 00:00:00 | 522 | 0.5% |
| 2018-03-12 00:00:00 | 516 | 0.5% |
| 2018-03-13 00:00:00 | 513 | 0.5% |
| 2018-05-29 00:00:00 | 513 | 0.5% |
| 2018-02-14 00:00:00 | 507 | 0.5% |
| 2017-12-18 00:00:00 | 493 | 0.5% |
| 2018-05-28 00:00:00 | 492 | 0.5% |
| 2018-03-06 00:00:00 | 492 | 0.5% |
| 2018-02-06 00:00:00 | 491 | 0.5% |
| 2018-04-12 00:00:00 | 490 | 0.5% |
| Other values (413) | 94083 |
Length
| Value | Count | Frequency (%) |
| 00:00:00 | 99112 | |
| 2017-12-20 | 522 | 0.3% |
| 2018-03-12 | 516 | 0.3% |
| 2018-03-13 | 513 | 0.3% |
| 2018-05-29 | 513 | 0.3% |
| 2018-02-14 | 507 | 0.3% |
| 2017-12-18 | 493 | 0.2% |
| 2018-05-28 | 492 | 0.2% |
| 2018-03-06 | 492 | 0.2% |
| 2018-02-06 | 491 | 0.2% |
| Other values (414) | 94573 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 820140 | |
| - | 198224 | 10.5% |
| : | 198224 | 10.5% |
| 1 | 169147 | 9.0% |
| 2 | 154439 | 8.2% |
| 99112 | 5.3% | |
| 8 | 82589 | 4.4% |
| 7 | 60284 | 3.2% |
| 3 | 26556 | 1.4% |
| 5 | 20357 | 1.1% |
| Other values (3) | 54056 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1387568 | |
| Dash Punctuation | 198224 | 10.5% |
| Other Punctuation | 198224 | 10.5% |
| Space Separator | 99112 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 820140 | |
| 1 | 169147 | 12.2% |
| 2 | 154439 | 11.1% |
| 8 | 82589 | 6.0% |
| 7 | 60284 | 4.3% |
| 3 | 26556 | 1.9% |
| 5 | 20357 | 1.5% |
| 6 | 19001 | 1.4% |
| 4 | 18580 | 1.3% |
| 9 | 16475 | 1.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 198224 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 198224 |
Space Separator
| Value | Count | Frequency (%) |
| 99112 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1883128 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 820140 | |
| - | 198224 | 10.5% |
| : | 198224 | 10.5% |
| 1 | 169147 | 9.0% |
| 2 | 154439 | 8.2% |
| 99112 | 5.3% | |
| 8 | 82589 | 4.4% |
| 7 | 60284 | 3.2% |
| 3 | 26556 | 1.4% |
| 5 | 20357 | 1.1% |
| Other values (3) | 54056 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1883128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 820140 | |
| - | 198224 | 10.5% |
| : | 198224 | 10.5% |
| 1 | 169147 | 9.0% |
| 2 | 154439 | 8.2% |
| 99112 | 5.3% | |
| 8 | 82589 | 4.4% |
| 7 | 60284 | 3.2% |
| 3 | 26556 | 1.4% |
| 5 | 20357 | 1.1% |
| Other values (3) | 54056 | 2.9% |
review_score
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 763 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| 5.0 | |
|---|---|
| 4.0 | |
| 1.0 | |
| 3.0 | |
| 2.0 | 3123 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 295047 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4.0 |
|---|---|
| 2nd row | 4.0 |
| 3rd row | 5.0 |
| 4th row | 5.0 |
| 5th row | 5.0 |
Common Values
| Value | Count | Frequency (%) |
| 5.0 | 56853 | |
| 4.0 | 18987 | 19.2% |
| 1.0 | 11274 | 11.4% |
| 3.0 | 8112 | 8.2% |
| 2.0 | 3123 | 3.2% |
| (Missing) | 763 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5.0 | 56853 | |
| 4.0 | 18987 | 19.3% |
| 1.0 | 11274 | 11.5% |
| 3.0 | 8112 | 8.2% |
| 2.0 | 3123 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 98349 | |
| 0 | 98349 | |
| 5 | 56853 | |
| 4 | 18987 | 6.4% |
| 1 | 11274 | 3.8% |
| 3 | 8112 | 2.7% |
| 2 | 3123 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 196698 | |
| Other Punctuation | 98349 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 98349 | |
| 5 | 56853 | |
| 4 | 18987 | 9.7% |
| 1 | 11274 | 5.7% |
| 3 | 8112 | 4.1% |
| 2 | 3123 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 98349 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 295047 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 98349 | |
| 0 | 98349 | |
| 5 | 56853 | |
| 4 | 18987 | 6.4% |
| 1 | 11274 | 3.8% |
| 3 | 8112 | 2.7% |
| 2 | 3123 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 295047 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 98349 | |
| 0 | 98349 | |
| 5 | 56853 | |
| 4 | 18987 | 6.4% |
| 1 | 11274 | 3.8% |
| 3 | 8112 | 2.7% |
| 2 | 3123 | 1.1% |
length_comment_title
Real number (ℝ)
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 763 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3829017 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 86798 |
| Zeros (%) | 87.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 12 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.3648976 |
|---|---|
| Coefficient of variation (CV) | 3.1563325 |
| Kurtosis | 11.62685 |
| Mean | 1.3829017 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4511124 |
| Sum | 136007 |
| Variance | 19.052331 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 86798 | |
| 9 | 2000 | 2.0% |
| 5 | 1117 | 1.1% |
| 15 | 870 | 0.9% |
| 3 | 703 | 0.7% |
| 10 | 575 | 0.6% |
| 13 | 489 | 0.5% |
| 17 | 482 | 0.5% |
| 25 | 429 | 0.4% |
| 14 | 399 | 0.4% |
| Other values (17) | 4487 | 4.5% |
| (Missing) | 763 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 86798 | |
| 1 | 164 | 0.2% |
| 2 | 253 | 0.3% |
| 3 | 703 | 0.7% |
| 4 | 178 | 0.2% |
| 5 | 1117 | 1.1% |
| 6 | 243 | 0.2% |
| 7 | 388 | 0.4% |
| 8 | 342 | 0.3% |
| 9 | 2000 | 2.0% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 25 | 429 | |
| 24 | 221 | |
| 23 | 213 | |
| 22 | 213 | |
| 21 | 239 | |
| 20 | 390 | |
| 19 | 268 | |
| 18 | 301 | |
| 17 | 482 |
length_comment_message
Real number (ℝ)
| Distinct | 209 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 763 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.152091 |
| Minimum | 0 |
|---|---|
| Maximum | 208 |
| Zeros | 57807 |
| Zeros (%) | 58.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 41 |
| 95-th percentile | 146 |
| Maximum | 208 |
| Range | 208 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 48.19114 |
|---|---|
| Coefficient of variation (CV) | 1.7118139 |
| Kurtosis | 3.346744 |
| Mean | 28.152091 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.9890494 |
| Sum | 2768730 |
| Variance | 2322.3859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 57807 | |
| 9 | 1003 | 1.0% |
| 200 | 589 | 0.6% |
| 5 | 556 | 0.6% |
| 3 | 514 | 0.5% |
| 26 | 499 | 0.5% |
| 20 | 473 | 0.5% |
| 10 | 469 | 0.5% |
| 34 | 460 | 0.5% |
| 31 | 448 | 0.5% |
| Other values (199) | 35531 | |
| (Missing) | 763 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 57807 | |
| 1 | 97 | 0.1% |
| 2 | 196 | 0.2% |
| 3 | 514 | 0.5% |
| 4 | 101 | 0.1% |
| 5 | 556 | 0.6% |
| 6 | 205 | 0.2% |
| 7 | 235 | 0.2% |
| 8 | 244 | 0.2% |
| 9 | 1003 | 1.0% |
| Value | Count | Frequency (%) |
| 208 | 1 | < 0.1% |
| 207 | 1 | < 0.1% |
| 206 | 1 | < 0.1% |
| 205 | 1 | < 0.1% |
| 204 | 14 | < 0.1% |
| 203 | 17 | < 0.1% |
| 202 | 12 | < 0.1% |
| 201 | 22 | < 0.1% |
| 200 | 589 | |
| 199 | 333 |
| Distinct | 97644 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 763 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| Minimum | 2017-01-13 20:22:46 |
|---|---|
| Maximum | 2018-10-29 12:27:35 |
payment_type
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| credit_card | |
|---|---|
| boleto | |
| credit_card,voucher | 2240 |
| voucher | 1615 |
| debit_card | 1525 |
| Other values (2) | 4 |
Length
| Max length | 22 |
|---|---|
| Median length | 11 |
| Mean length | 10.105467 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1001573 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | credit_card,voucher |
|---|---|
| 2nd row | boleto |
| 3rd row | credit_card |
| 4th row | credit_card |
| 5th row | credit_card |
Common Values
| Value | Count | Frequency (%) |
| credit_card | 74007 | |
| boleto | 19721 | 19.9% |
| credit_card,voucher | 2240 | 2.3% |
| voucher | 1615 | 1.6% |
| debit_card | 1525 | 1.5% |
| not_defined | 3 | < 0.1% |
| credit_card,debit_card | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| credit_card | 74007 | |
| boleto | 19721 | 19.9% |
| credit_card,voucher | 2240 | 2.3% |
| voucher | 1615 | 1.6% |
| debit_card | 1525 | 1.5% |
| not_defined | 3 | < 0.1% |
| credit_card,debit_card | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 157877 | |
| r | 157877 | |
| d | 155554 | |
| e | 101356 | |
| t | 97498 | |
| i | 77777 | |
| _ | 77777 | |
| a | 77774 | |
| o | 43300 | 4.3% |
| b | 21247 | 2.1% |
| Other values (7) | 33536 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 921555 | |
| Connector Punctuation | 77777 | 7.8% |
| Other Punctuation | 2241 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 157877 | |
| r | 157877 | |
| d | 155554 | |
| e | 101356 | |
| t | 97498 | |
| i | 77777 | |
| a | 77774 | |
| o | 43300 | 4.7% |
| b | 21247 | 2.3% |
| l | 19721 | 2.1% |
| Other values (5) | 11574 | 1.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 77777 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2241 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 921555 | |
| Common | 80018 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 157877 | |
| r | 157877 | |
| d | 155554 | |
| e | 101356 | |
| t | 97498 | |
| i | 77777 | |
| a | 77774 | |
| o | 43300 | 4.7% |
| b | 21247 | 2.3% |
| l | 19721 | 2.1% |
| Other values (5) | 11574 | 1.3% |
Common
| Value | Count | Frequency (%) |
| _ | 77777 | |
| , | 2241 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1001573 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 157877 | |
| r | 157877 | |
| d | 155554 | |
| e | 101356 | |
| t | 97498 | |
| i | 77777 | |
| _ | 77777 | |
| a | 77774 | |
| o | 43300 | 4.3% |
| b | 21247 | 2.1% |
| Other values (7) | 33536 | 3.3% |
payment_installments
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.928152 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.7140631 |
|---|---|
| Coefficient of variation (CV) | 0.92688602 |
| Kurtosis | 2.3666857 |
| Mean | 2.928152 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6018438 |
| Sum | 290215 |
| Variance | 7.3661387 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 48140 | |
| 2 | 12332 | 12.4% |
| 3 | 10385 | 10.5% |
| 4 | 7044 | 7.1% |
| 10 | 5273 | 5.3% |
| 5 | 5207 | 5.3% |
| 8 | 4248 | 4.3% |
| 6 | 3890 | 3.9% |
| 7 | 1609 | 1.6% |
| 9 | 641 | 0.6% |
| Other values (14) | 343 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 48140 | |
| 2 | 12332 | 12.4% |
| 3 | 10385 | 10.5% |
| 4 | 7044 | 7.1% |
| 5 | 5207 | 5.3% |
| 6 | 3890 | 3.9% |
| 7 | 1609 | 1.6% |
| 8 | 4248 | 4.3% |
| 9 | 641 | 0.6% |
| Value | Count | Frequency (%) |
| 24 | 18 | < 0.1% |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 3 | < 0.1% |
| 20 | 17 | < 0.1% |
| 18 | 27 | < 0.1% |
| 17 | 8 | < 0.1% |
| 16 | 5 | < 0.1% |
| 15 | 74 | |
| 14 | 15 | < 0.1% |
payment_value
Real number (ℝ)
| Distinct | 27892 |
|---|---|
| Distinct (%) | 28.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 160.9241 |
| Minimum | 0 |
|---|---|
| Maximum | 13664.08 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 32.38 |
| Q1 | 62.0075 |
| median | 105.28 |
| Q3 | 176.88 |
| 95-th percentile | 452.2845 |
| Maximum | 13664.08 |
| Range | 13664.08 |
| Interquartile range (IQR) | 114.8725 |
Descriptive statistics
| Standard deviation | 222.00748 |
|---|---|
| Coefficient of variation (CV) | 1.3795788 |
| Kurtosis | 233.92317 |
| Mean | 160.9241 |
| Median Absolute Deviation (MAD) | 51.6 |
| Skewness | 9.1659713 |
| Sum | 15949510 |
| Variance | 49287.321 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77.57 | 254 | 0.3% |
| 35 | 169 | 0.2% |
| 73.34 | 163 | 0.2% |
| 116.94 | 132 | 0.1% |
| 56.78 | 124 | 0.1% |
| 107.78 | 121 | 0.1% |
| 65 | 117 | 0.1% |
| 86.15 | 107 | 0.1% |
| 99.9 | 106 | 0.1% |
| 67.5 | 105 | 0.1% |
| Other values (27882) | 97714 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 9.59 | 1 | < 0.1% |
| 10.07 | 1 | < 0.1% |
| 10.89 | 1 | < 0.1% |
| 11.56 | 1 | < 0.1% |
| 11.62 | 1 | < 0.1% |
| 11.63 | 2 | |
| 12.22 | 1 | < 0.1% |
| 12.28 | 1 | < 0.1% |
| 12.39 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 13664.08 | 1 | |
| 7274.88 | 1 | |
| 6929.31 | 1 | |
| 6922.21 | 1 | |
| 6726.66 | 1 | |
| 6081.54 | 1 | |
| 4950.34 | 1 | |
| 4809.44 | 1 | |
| 4764.34 | 1 | |
| 4681.78 | 1 |
product_most_frequent
Categorical
| Distinct | 31695 |
|---|---|
| Distinct (%) | 32.2% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 |
|---|---|
| 99a4788cb24856965c36a24e339b6058 | 427 |
| 422879e10f46682990de24d770e7f83d | 339 |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 |
| Other values (31690) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3147328 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18852 ? |
|---|---|
| Unique (%) | 19.2% |
Sample
| 1st row | 87285b34884572647811a353c7ac498a |
|---|---|
| 2nd row | 595fac2a385ac33a80bd5114aec74eb8 |
| 3rd row | aa4383b373c6aca5d8797843e5594415 |
| 4th row | d0b61bfb1de832b15ba9d266ca96e5b0 |
| 5th row | 65266b2da20d04dbe00c5c2d3bb7859e |
Common Values
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 427 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 339 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 | 0.3% |
| 389d119b48cf3043d311335e499d9c6b | 299 | 0.3% |
| 368c6c730842d78016ad823897a372db | 285 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 269 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 264 | 0.3% |
| 2b4609f8948be18874494203496bc318 | 258 | 0.3% |
| Other values (31685) | 95170 | |
| (Missing) | 758 | 0.8% |
Length
| Value | Count | Frequency (%) |
| aca2eb7d00ea1a7b8ebd4e68314663af | 429 | 0.4% |
| 99a4788cb24856965c36a24e339b6058 | 427 | 0.4% |
| 422879e10f46682990de24d770e7f83d | 339 | 0.3% |
| d1c427060a0f73f6b889a5c7c61f2ac4 | 311 | 0.3% |
| 53b36df67ebb7c41585e8d54d6772e08 | 303 | 0.3% |
| 389d119b48cf3043d311335e499d9c6b | 299 | 0.3% |
| 368c6c730842d78016ad823897a372db | 285 | 0.3% |
| 154e7e31ebfa092203795c972e5804a6 | 269 | 0.3% |
| 53759a2ecddad2bb87a079a1f1519f73 | 264 | 0.3% |
| 2b4609f8948be18874494203496bc318 | 258 | 0.3% |
| Other values (31685) | 95170 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 202346 | 6.4% |
| 9 | 199979 | 6.4% |
| 8 | 198473 | 6.3% |
| e | 198357 | 6.3% |
| 7 | 197610 | 6.3% |
| 4 | 197555 | 6.3% |
| a | 197524 | 6.3% |
| 0 | 197277 | 6.3% |
| c | 196851 | 6.3% |
| 5 | 196311 | 6.2% |
| Other values (6) | 1165045 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1976192 | |
| Lowercase Letter | 1171136 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 202346 | |
| 9 | 199979 | |
| 8 | 198473 | |
| 7 | 197610 | |
| 4 | 197555 | |
| 0 | 197277 | |
| 5 | 196311 | |
| 2 | 196289 | |
| 6 | 195418 | |
| 1 | 194934 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 198357 | |
| a | 197524 | |
| c | 196851 | |
| b | 194783 | |
| d | 193184 | |
| f | 190437 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1976192 | |
| Latin | 1171136 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 202346 | |
| 9 | 199979 | |
| 8 | 198473 | |
| 7 | 197610 | |
| 4 | 197555 | |
| 0 | 197277 | |
| 5 | 196311 | |
| 2 | 196289 | |
| 6 | 195418 | |
| 1 | 194934 |
Latin
| Value | Count | Frequency (%) |
| e | 198357 | |
| a | 197524 | |
| c | 196851 | |
| b | 194783 | |
| d | 193184 | |
| f | 190437 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3147328 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 202346 | 6.4% |
| 9 | 199979 | 6.4% |
| 8 | 198473 | 6.3% |
| e | 198357 | 6.3% |
| 7 | 197610 | 6.3% |
| 4 | 197555 | 6.3% |
| a | 197524 | 6.3% |
| 0 | 197277 | 6.3% |
| c | 196851 | 6.3% |
| 5 | 196311 | 6.2% |
| Other values (6) | 1165045 |
nb_items
Real number (ℝ)
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1415906 |
| Minimum | 1 |
|---|---|
| Maximum | 21 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 21 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.53809943 |
|---|---|
| Coefficient of variation (CV) | 0.47135938 |
| Kurtosis | 115.34737 |
| Mean | 1.1415906 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.5401727 |
| Sum | 112280 |
| Variance | 0.289551 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 88587 | |
| 2 | 7491 | 7.6% |
| 3 | 1317 | 1.3% |
| 4 | 502 | 0.5% |
| 5 | 203 | 0.2% |
| 6 | 196 | 0.2% |
| 7 | 22 | < 0.1% |
| 10 | 8 | < 0.1% |
| 8 | 8 | < 0.1% |
| 12 | 5 | < 0.1% |
| Other values (7) | 15 | < 0.1% |
| (Missing) | 758 | 0.8% |
| Value | Count | Frequency (%) |
| 1 | 88587 | |
| 2 | 7491 | 7.6% |
| 3 | 1317 | 1.3% |
| 4 | 502 | 0.5% |
| 5 | 203 | 0.2% |
| 6 | 196 | 0.2% |
| 7 | 22 | < 0.1% |
| 8 | 8 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 2 | < 0.1% |
| 13 | 1 | < 0.1% |
| 12 | 5 | |
| 11 | 4 | |
| 10 | 8 | |
| 9 | 3 | < 0.1% |
| 8 | 8 |
sum_price
Real number (ℝ)
| Distinct | 7751 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.68487 |
| Minimum | 0.85 |
|---|---|
| Maximum | 13440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0.85 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 45.9 |
| median | 86.9 |
| Q3 | 149.9 |
| 95-th percentile | 399.9 |
| Maximum | 13440 |
| Range | 13439.15 |
| Interquartile range (IQR) | 104 |
Descriptive statistics
| Standard deviation | 210.68338 |
|---|---|
| Coefficient of variation (CV) | 1.5301854 |
| Kurtosis | 266.69704 |
| Mean | 137.68487 |
| Median Absolute Deviation (MAD) | 47.9 |
| Skewness | 9.7453475 |
| Sum | 13541858 |
| Variance | 44387.486 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 59.9 | 1712 | 1.7% |
| 69.9 | 1601 | 1.6% |
| 49.9 | 1411 | 1.4% |
| 89.9 | 1241 | 1.3% |
| 99.9 | 1186 | 1.2% |
| 79.9 | 1004 | 1.0% |
| 39.9 | 975 | 1.0% |
| 29.9 | 958 | 1.0% |
| 19.9 | 911 | 0.9% |
| 29.99 | 870 | 0.9% |
| Other values (7741) | 86485 |
| Value | Count | Frequency (%) |
| 0.85 | 2 | |
| 2.2 | 1 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.9 | 1 | < 0.1% |
| 2.99 | 1 | < 0.1% |
| 3 | 2 | |
| 3.49 | 1 | < 0.1% |
| 3.5 | 2 | |
| 3.54 | 1 | < 0.1% |
| 3.85 | 3 |
| Value | Count | Frequency (%) |
| 13440 | 1 | |
| 7160 | 1 | |
| 6735 | 1 | |
| 6729 | 1 | |
| 6499 | 1 | |
| 5934.6 | 1 | |
| 4799 | 1 | |
| 4690 | 1 | |
| 4599.9 | 1 | |
| 4590 | 1 |
sum_freight_value
Real number (ℝ)
| Distinct | 7954 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.820752 |
| Minimum | 0 |
|---|---|
| Maximum | 1794.96 |
| Zeros | 338 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.88 |
| Q1 | 13.85 |
| median | 17.17 |
| Q3 | 24.03 |
| 95-th percentile | 54.92 |
| Maximum | 1794.96 |
| Range | 1794.96 |
| Interquartile range (IQR) | 10.18 |
Descriptive statistics
| Standard deviation | 21.656041 |
|---|---|
| Coefficient of variation (CV) | 0.94896265 |
| Kurtosis | 566.55997 |
| Mean | 22.820752 |
| Median Absolute Deviation (MAD) | 4.38 |
| Skewness | 12.072998 |
| Sum | 2244512.2 |
| Variance | 468.98413 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15.1 | 2952 | 3.0% |
| 7.78 | 1839 | 1.9% |
| 14.1 | 1529 | 1.5% |
| 11.85 | 1444 | 1.5% |
| 18.23 | 1219 | 1.2% |
| 7.39 | 1137 | 1.1% |
| 15.23 | 823 | 0.8% |
| 16.11 | 794 | 0.8% |
| 8.72 | 761 | 0.8% |
| 16.79 | 697 | 0.7% |
| Other values (7944) | 85159 | |
| (Missing) | 758 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 338 | |
| 5.7 | 1 | < 0.1% |
| 5.82 | 1 | < 0.1% |
| 5.88 | 2 | < 0.1% |
| 6.52 | 1 | < 0.1% |
| 6.53 | 2 | < 0.1% |
| 6.56 | 1 | < 0.1% |
| 6.57 | 5 | < 0.1% |
| 6.78 | 5 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1794.96 | 1 | |
| 1002.29 | 1 | |
| 711.33 | 1 | |
| 626.64 | 1 | |
| 502.98 | 1 | |
| 497.42 | 1 | |
| 497.08 | 1 | |
| 479.28 | 1 | |
| 458.73 | 1 | |
| 456.47 | 1 |
customer_unique_id
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 95780 |
|---|---|
| Distinct (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 17 |
|---|---|
| 3e43e6105506432c953e165fb2acf44c | 9 |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 |
| ca77025e7201e3b30c44b472ff346268 | 7 |
| Other values (95775) |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 32 |
| Min length | 32 |
Characters and Unicode
| Total characters | 3171584 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 92795 ? |
|---|---|
| Unique (%) | 93.6% |
Sample
| 1st row | 7c396fd4830fd04220f754e42b4e5bff |
|---|---|
| 2nd row | af07308b275d755c9edb36a90c618231 |
| 3rd row | 3a653a41f6f9fc3d2a113cf8398680e8 |
| 4th row | 7c142cf63193a1473d2e66489a9ae977 |
| 5th row | 72632f0f9dd73dfee390c9b22eb56dd6 |
Common Values
| Value | Count | Frequency (%) |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 17 | < 0.1% |
| 3e43e6105506432c953e165fb2acf44c | 9 | < 0.1% |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 | < 0.1% |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 | < 0.1% |
| ca77025e7201e3b30c44b472ff346268 | 7 | < 0.1% |
| f0e310a6839dce9de1638e0fe5ab282a | 6 | < 0.1% |
| de34b16117594161a6a89c50b289d35a | 6 | < 0.1% |
| dc813062e0fc23409cd255f7f53c7074 | 6 | < 0.1% |
| 12f5d6e1cbf93dafd9dcc19095df0b3d | 6 | < 0.1% |
| 47c1a3033b8b77b3ab6e109eb4d5fdf3 | 6 | < 0.1% |
| Other values (95770) | 99035 |
Length
| Value | Count | Frequency (%) |
| 8d50f5eadf50201ccdcedfb9e2ac8455 | 17 | < 0.1% |
| 3e43e6105506432c953e165fb2acf44c | 9 | < 0.1% |
| 1b6c7548a2a1f9037c1fd3ddfed95f33 | 7 | < 0.1% |
| ca77025e7201e3b30c44b472ff346268 | 7 | < 0.1% |
| 6469f99c1f9dfae7733b25662e7f1782 | 7 | < 0.1% |
| f0e310a6839dce9de1638e0fe5ab282a | 6 | < 0.1% |
| de34b16117594161a6a89c50b289d35a | 6 | < 0.1% |
| dc813062e0fc23409cd255f7f53c7074 | 6 | < 0.1% |
| 12f5d6e1cbf93dafd9dcc19095df0b3d | 6 | < 0.1% |
| 47c1a3033b8b77b3ab6e109eb4d5fdf3 | 6 | < 0.1% |
| Other values (95770) | 99035 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 198691 | 6.3% |
| 8 | 198678 | 6.3% |
| 1 | 198671 | 6.3% |
| a | 198460 | 6.3% |
| d | 198441 | 6.3% |
| b | 198367 | 6.3% |
| 0 | 198366 | 6.3% |
| 5 | 198350 | 6.3% |
| 2 | 198265 | 6.3% |
| e | 198253 | 6.3% |
| Other values (6) | 1187042 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1982483 | |
| Lowercase Letter | 1189101 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 198691 | |
| 8 | 198678 | |
| 1 | 198671 | |
| 0 | 198366 | |
| 5 | 198350 | |
| 2 | 198265 | |
| 9 | 198203 | |
| 3 | 197987 | |
| 4 | 197737 | |
| 7 | 197535 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 198460 | |
| d | 198441 | |
| b | 198367 | |
| e | 198253 | |
| f | 197959 | |
| c | 197621 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1982483 | |
| Latin | 1189101 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 198691 | |
| 8 | 198678 | |
| 1 | 198671 | |
| 0 | 198366 | |
| 5 | 198350 | |
| 2 | 198265 | |
| 9 | 198203 | |
| 3 | 197987 | |
| 4 | 197737 | |
| 7 | 197535 |
Latin
| Value | Count | Frequency (%) |
| a | 198460 | |
| d | 198441 | |
| b | 198367 | |
| e | 198253 | |
| f | 197959 | |
| c | 197621 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3171584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 198691 | 6.3% |
| 8 | 198678 | 6.3% |
| 1 | 198671 | 6.3% |
| a | 198460 | 6.3% |
| d | 198441 | 6.3% |
| b | 198367 | 6.3% |
| 0 | 198366 | 6.3% |
| 5 | 198350 | 6.3% |
| 2 | 198265 | 6.3% |
| e | 198253 | 6.3% |
| Other values (6) | 1187042 |
customer_city
Categorical
| Distinct | 4116 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| sao paulo | |
|---|---|
| rio de janeiro | 6844 |
| belo horizonte | 2761 |
| brasilia | 2125 |
| curitiba | 1515 |
| Other values (4111) |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 10.342945 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1025110 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1144 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | sao paulo |
|---|---|
| 2nd row | barreiras |
| 3rd row | vianopolis |
| 4th row | sao goncalo do amarante |
| 5th row | santo andre |
Common Values
| Value | Count | Frequency (%) |
| sao paulo | 15504 | 15.6% |
| rio de janeiro | 6844 | 6.9% |
| belo horizonte | 2761 | 2.8% |
| brasilia | 2125 | 2.1% |
| curitiba | 1515 | 1.5% |
| campinas | 1437 | 1.4% |
| porto alegre | 1372 | 1.4% |
| salvador | 1245 | 1.3% |
| guarulhos | 1188 | 1.2% |
| sao bernardo do campo | 935 | 0.9% |
| Other values (4106) | 64186 |
Length
| Value | Count | Frequency (%) |
| sao | 20990 | 12.1% |
| paulo | 15570 | 9.0% |
| de | 9633 | 5.5% |
| rio | 8237 | 4.7% |
| janeiro | 6844 | 3.9% |
| do | 4260 | 2.5% |
| belo | 2820 | 1.6% |
| horizonte | 2786 | 1.6% |
| brasilia | 2134 | 1.2% |
| porto | 1641 | 0.9% |
| Other values (3282) | 98785 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 169060 | |
| o | 126099 | |
| i | 78475 | 7.7% |
| r | 76234 | 7.4% |
| 74588 | 7.3% | |
| e | 66777 | 6.5% |
| s | 62696 | 6.1% |
| n | 45532 | 4.4% |
| u | 44786 | 4.4% |
| l | 44678 | 4.4% |
| Other values (21) | 236185 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 950063 | |
| Space Separator | 74588 | 7.3% |
| Dash Punctuation | 231 | < 0.1% |
| Other Punctuation | 226 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 169060 | |
| o | 126099 | |
| i | 78475 | 8.3% |
| r | 76234 | 8.0% |
| e | 66777 | 7.0% |
| s | 62696 | 6.6% |
| n | 45532 | 4.8% |
| u | 44786 | 4.7% |
| l | 44678 | 4.7% |
| p | 37012 | 3.9% |
| Other values (16) | 198714 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 74588 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 231 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 226 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 950063 | |
| Common | 75047 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 169060 | |
| o | 126099 | |
| i | 78475 | 8.3% |
| r | 76234 | 8.0% |
| e | 66777 | 7.0% |
| s | 62696 | 6.6% |
| n | 45532 | 4.8% |
| u | 44786 | 4.7% |
| l | 44678 | 4.7% |
| p | 37012 | 3.9% |
| Other values (16) | 198714 |
Common
| Value | Count | Frequency (%) |
| 74588 | ||
| - | 231 | 0.3% |
| ' | 226 | 0.3% |
| 1 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1025110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 169060 | |
| o | 126099 | |
| i | 78475 | 7.7% |
| r | 76234 | 7.4% |
| 74588 | 7.3% | |
| e | 66777 | 6.5% |
| s | 62696 | 6.1% |
| n | 45532 | 4.4% |
| u | 44786 | 4.4% |
| l | 44678 | 4.4% |
| Other values (21) | 236185 |
customer_state
Categorical
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| SP | |
|---|---|
| RJ | |
| MG | |
| RS | |
| PR | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 198224 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SP |
|---|---|
| 2nd row | BA |
| 3rd row | GO |
| 4th row | RN |
| 5th row | SP |
Common Values
| Value | Count | Frequency (%) |
| SP | 41631 | |
| RJ | 12796 | 12.9% |
| MG | 11595 | 11.7% |
| RS | 5441 | 5.5% |
| PR | 5025 | 5.1% |
| SC | 3626 | 3.7% |
| BA | 3376 | 3.4% |
| DF | 2134 | 2.2% |
| ES | 2029 | 2.0% |
| GO | 2011 | 2.0% |
| Other values (17) | 9448 | 9.5% |
Length
| Value | Count | Frequency (%) |
| sp | 41631 | |
| rj | 12796 | 12.9% |
| mg | 11595 | 11.7% |
| rs | 5441 | 5.5% |
| pr | 5025 | 5.1% |
| sc | 3626 | 3.7% |
| ba | 3376 | 3.4% |
| df | 2134 | 2.2% |
| es | 2029 | 2.0% |
| go | 2011 | 2.0% |
| Other values (17) | 9448 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 53789 | |
| P | 50369 | |
| R | 24084 | |
| M | 14105 | 7.1% |
| G | 13606 | 6.9% |
| J | 12796 | 6.5% |
| A | 5798 | 2.9% |
| E | 5349 | 2.7% |
| C | 5035 | 2.5% |
| B | 3911 | 2.0% |
| Other values (7) | 9382 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 198224 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 53789 | |
| P | 50369 | |
| R | 24084 | |
| M | 14105 | 7.1% |
| G | 13606 | 6.9% |
| J | 12796 | 6.5% |
| A | 5798 | 2.9% |
| E | 5349 | 2.7% |
| C | 5035 | 2.5% |
| B | 3911 | 2.0% |
| Other values (7) | 9382 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 198224 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 53789 | |
| P | 50369 | |
| R | 24084 | |
| M | 14105 | 7.1% |
| G | 13606 | 6.9% |
| J | 12796 | 6.5% |
| A | 5798 | 2.9% |
| E | 5349 | 2.7% |
| C | 5035 | 2.5% |
| B | 3911 | 2.0% |
| Other values (7) | 9382 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 198224 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 53789 | |
| P | 50369 | |
| R | 24084 | |
| M | 14105 | 7.1% |
| G | 13606 | 6.9% |
| J | 12796 | 6.5% |
| A | 5798 | 2.9% |
| E | 5349 | 2.7% |
| C | 5035 | 2.5% |
| B | 3911 | 2.0% |
| Other values (7) | 9382 | 4.7% |
product_description_lenght
Real number (ℝ)
| Distinct | 2952 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 782.43398 |
| Minimum | 0 |
|---|---|
| Maximum | 3992 |
| Zeros | 1418 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 138 |
| Q1 | 341 |
| median | 600 |
| Q3 | 986 |
| 95-th percentile | 2120 |
| Maximum | 3992 |
| Range | 3992 |
| Interquartile range (IQR) | 645 |
Descriptive statistics
| Standard deviation | 656.64195 |
|---|---|
| Coefficient of variation (CV) | 0.83922985 |
| Kurtosis | 4.8014892 |
| Mean | 782.43398 |
| Median Absolute Deviation (MAD) | 300 |
| Skewness | 1.9732056 |
| Sum | 76955512 |
| Variance | 431178.66 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1418 | 1.4% |
| 1893 | 583 | 0.6% |
| 492 | 539 | 0.5% |
| 341 | 536 | 0.5% |
| 903 | 479 | 0.5% |
| 245 | 477 | 0.5% |
| 348 | 458 | 0.5% |
| 236 | 428 | 0.4% |
| 366 | 394 | 0.4% |
| 575 | 361 | 0.4% |
| Other values (2942) | 92681 | |
| (Missing) | 758 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 1418 | |
| 4 | 6 | < 0.1% |
| 8 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 6 | < 0.1% |
| 26 | 2 | < 0.1% |
| 27 | 3 | < 0.1% |
| 28 | 2 | < 0.1% |
| 30 | 7 | < 0.1% |
| 31 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3992 | 2 | < 0.1% |
| 3988 | 1 | < 0.1% |
| 3985 | 3 | |
| 3976 | 3 | |
| 3963 | 1 | < 0.1% |
| 3956 | 2 | < 0.1% |
| 3954 | 2 | < 0.1% |
| 3950 | 1 | < 0.1% |
| 3948 | 1 | < 0.1% |
| 3947 | 6 |
product_photos_qty
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.217012 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 1418 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.7528737 |
|---|---|
| Coefficient of variation (CV) | 0.7906469 |
| Kurtosis | 4.4503856 |
| Mean | 2.217012 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.8254914 |
| Sum | 218052 |
| Variance | 3.0725661 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 47922 | |
| 2 | 19079 | 19.2% |
| 3 | 11176 | 11.3% |
| 4 | 7558 | 7.6% |
| 5 | 4968 | 5.0% |
| 6 | 3384 | 3.4% |
| 0 | 1418 | 1.4% |
| 7 | 1402 | 1.4% |
| 8 | 676 | 0.7% |
| 10 | 320 | 0.3% |
| Other values (10) | 451 | 0.5% |
| (Missing) | 758 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 1418 | 1.4% |
| 1 | 47922 | |
| 2 | 19079 | 19.2% |
| 3 | 11176 | 11.3% |
| 4 | 7558 | 7.6% |
| 5 | 4968 | 5.0% |
| 6 | 3384 | 3.4% |
| 7 | 1402 | 1.4% |
| 8 | 676 | 0.7% |
| 9 | 289 | 0.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 2 | < 0.1% |
| 18 | 4 | < 0.1% |
| 17 | 8 | < 0.1% |
| 15 | 12 | < 0.1% |
| 14 | 6 | < 0.1% |
| 13 | 26 | < 0.1% |
| 12 | 44 | < 0.1% |
| 11 | 59 | 0.1% |
| 10 | 320 |
product_weight_g
Real number (ℝ)
| Distinct | 2182 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 774 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2100.4419 |
| Minimum | 0 |
|---|---|
| Maximum | 40425 |
| Zeros | 6 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 125 |
| Q1 | 300 |
| median | 700 |
| Q3 | 1800 |
| 95-th percentile | 9750 |
| Maximum | 40425 |
| Range | 40425 |
| Interquartile range (IQR) | 1500 |
Descriptive statistics
| Standard deviation | 3761.5403 |
|---|---|
| Coefficient of variation (CV) | 1.7908328 |
| Kurtosis | 16.409562 |
| Mean | 2100.4419 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 3.6098005 |
| Sum | 2.0655326 × 108 |
| Variance | 14149185 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 5908 | 6.0% |
| 150 | 4631 | 4.7% |
| 250 | 3992 | 4.0% |
| 300 | 3717 | 3.8% |
| 400 | 3172 | 3.2% |
| 100 | 3100 | 3.1% |
| 350 | 2819 | 2.8% |
| 500 | 2356 | 2.4% |
| 600 | 2278 | 2.3% |
| 700 | 1743 | 1.8% |
| Other values (2172) | 64622 |
| Value | Count | Frequency (%) |
| 0 | 6 | < 0.1% |
| 2 | 5 | < 0.1% |
| 25 | 3 | < 0.1% |
| 50 | 841 | |
| 53 | 2 | < 0.1% |
| 54 | 1 | < 0.1% |
| 55 | 2 | < 0.1% |
| 58 | 1 | < 0.1% |
| 60 | 8 | < 0.1% |
| 61 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 40425 | 3 | < 0.1% |
| 30000 | 254 | |
| 29800 | 1 | < 0.1% |
| 29750 | 1 | < 0.1% |
| 29700 | 3 | < 0.1% |
| 29600 | 5 | < 0.1% |
| 29500 | 1 | < 0.1% |
| 29250 | 1 | < 0.1% |
| 29150 | 1 | < 0.1% |
| 29100 | 1 | < 0.1% |
product_length_cm
Real number (ℝ)
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 774 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.096321 |
| Minimum | 7 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 18 |
| median | 25 |
| Q3 | 38 |
| 95-th percentile | 61.15 |
| Maximum | 105 |
| Range | 98 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 16.125012 |
|---|---|
| Coefficient of variation (CV) | 0.53578016 |
| Kurtosis | 3.7916737 |
| Mean | 30.096321 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 1.7711629 |
| Sum | 2959612 |
| Variance | 260.016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 15238 | 15.4% |
| 20 | 9112 | 9.2% |
| 30 | 6309 | 6.4% |
| 17 | 5328 | 5.4% |
| 18 | 5153 | 5.2% |
| 19 | 4133 | 4.2% |
| 25 | 4121 | 4.2% |
| 40 | 3557 | 3.6% |
| 22 | 3415 | 3.4% |
| 35 | 2574 | 2.6% |
| Other values (89) | 39398 |
| Value | Count | Frequency (%) |
| 7 | 30 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 4 | < 0.1% |
| 10 | 7 | < 0.1% |
| 11 | 82 | 0.1% |
| 12 | 34 | < 0.1% |
| 13 | 49 | < 0.1% |
| 14 | 119 | 0.1% |
| 15 | 178 | 0.2% |
| 16 | 15238 |
| Value | Count | Frequency (%) |
| 105 | 300 | |
| 104 | 29 | < 0.1% |
| 103 | 35 | < 0.1% |
| 102 | 42 | < 0.1% |
| 101 | 88 | 0.1% |
| 100 | 308 | |
| 99 | 33 | < 0.1% |
| 98 | 42 | < 0.1% |
| 97 | 10 | < 0.1% |
| 96 | 8 | < 0.1% |
product_height_cm
Real number (ℝ)
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 774 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.470215 |
| Minimum | 2 |
|---|---|
| Maximum | 105 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 44 |
| Maximum | 105 |
| Range | 103 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 13.306655 |
|---|---|
| Coefficient of variation (CV) | 0.80792234 |
| Kurtosis | 7.4876906 |
| Mean | 16.470215 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.2594422 |
| Sum | 1619648 |
| Variance | 177.06706 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 8500 | 8.6% |
| 20 | 5823 | 5.9% |
| 15 | 5652 | 5.7% |
| 12 | 5629 | 5.7% |
| 11 | 5471 | 5.5% |
| 2 | 4432 | 4.5% |
| 4 | 4218 | 4.3% |
| 8 | 4050 | 4.1% |
| 16 | 3977 | 4.0% |
| 5 | 3911 | 3.9% |
| Other values (92) | 46675 |
| Value | Count | Frequency (%) |
| 2 | 4432 | |
| 3 | 2335 | 2.4% |
| 4 | 4218 | |
| 5 | 3911 | |
| 6 | 3012 | 3.0% |
| 7 | 3702 | |
| 8 | 4050 | |
| 9 | 2796 | 2.8% |
| 10 | 8500 | |
| 11 | 5471 |
| Value | Count | Frequency (%) |
| 105 | 109 | |
| 104 | 12 | < 0.1% |
| 103 | 37 | < 0.1% |
| 102 | 7 | < 0.1% |
| 100 | 39 | < 0.1% |
| 99 | 5 | < 0.1% |
| 98 | 3 | < 0.1% |
| 97 | 1 | < 0.1% |
| 96 | 8 | < 0.1% |
| 95 | 21 | < 0.1% |
product_width_cm
Real number (ℝ)
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 774 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.019169 |
| Minimum | 6 |
|---|---|
| Maximum | 118 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 15 |
| median | 20 |
| Q3 | 30 |
| 95-th percentile | 45 |
| Maximum | 118 |
| Range | 112 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.748029 |
|---|---|
| Coefficient of variation (CV) | 0.51035851 |
| Kurtosis | 4.6191154 |
| Mean | 23.019169 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 1.7215798 |
| Sum | 2263659 |
| Variance | 138.01618 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 10436 | 10.5% |
| 11 | 9163 | 9.2% |
| 15 | 7891 | 8.0% |
| 16 | 7355 | 7.4% |
| 30 | 6415 | 6.5% |
| 12 | 4839 | 4.9% |
| 13 | 4676 | 4.7% |
| 14 | 4069 | 4.1% |
| 18 | 3554 | 3.6% |
| 40 | 3374 | 3.4% |
| Other values (84) | 36566 |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 16 | < 0.1% |
| 9 | 48 | < 0.1% |
| 10 | 67 | 0.1% |
| 11 | 9163 | |
| 12 | 4839 | |
| 13 | 4676 | |
| 14 | 4069 | |
| 15 | 7891 |
| Value | Count | Frequency (%) |
| 118 | 7 | < 0.1% |
| 105 | 14 | < 0.1% |
| 104 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 2 | < 0.1% |
| 101 | 2 | < 0.1% |
| 100 | 41 | |
| 98 | 1 | < 0.1% |
| 97 | 1 | < 0.1% |
| 95 | 2 | < 0.1% |
product_category_name_english
Categorical
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 758 |
| Missing (%) | 0.8% |
| Memory size | 1.5 MiB |
| bed_bath_table | |
|---|---|
| health_beauty | |
| sports_leisure | |
| computers_accessories | |
| furniture_decor | |
| Other values (67) |
Length
| Max length | 39 |
|---|---|
| Median length | 31 |
| Mean length | 12.772251 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1256202 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | housewares |
|---|---|
| 2nd row | perfumery |
| 3rd row | auto |
| 4th row | pet_shop |
| 5th row | stationery |
Common Values
| Value | Count | Frequency (%) |
| bed_bath_table | 9297 | 9.4% |
| health_beauty | 8759 | 8.8% |
| sports_leisure | 7662 | 7.7% |
| computers_accessories | 6641 | 6.7% |
| furniture_decor | 6307 | 6.4% |
| housewares | 5811 | 5.9% |
| watches_gifts | 5602 | 5.7% |
| telephony | 4179 | 4.2% |
| auto | 3867 | 3.9% |
| toys | 3826 | 3.9% |
| Other values (62) | 36403 |
Length
| Value | Count | Frequency (%) |
| bed_bath_table | 9297 | 9.5% |
| health_beauty | 8759 | 8.9% |
| sports_leisure | 7662 | 7.8% |
| computers_accessories | 6641 | 6.8% |
| furniture_decor | 6307 | 6.4% |
| housewares | 5811 | 5.9% |
| watches_gifts | 5602 | 5.7% |
| telephony | 4179 | 4.2% |
| auto | 3867 | 3.9% |
| toys | 3826 | 3.9% |
| Other values (62) | 36403 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 152199 | |
| s | 119339 | 9.5% |
| t | 110624 | 8.8% |
| o | 93377 | 7.4% |
| a | 85428 | 6.8% |
| r | 84089 | 6.7% |
| _ | 83219 | 6.6% |
| u | 64498 | 5.1% |
| c | 59606 | 4.7% |
| i | 51809 | 4.1% |
| Other values (15) | 352014 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1172727 | |
| Connector Punctuation | 83219 | 6.6% |
| Decimal Number | 256 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 152199 | |
| s | 119339 | 10.2% |
| t | 110624 | 9.4% |
| o | 93377 | 8.0% |
| a | 85428 | 7.3% |
| r | 84089 | 7.2% |
| u | 64498 | 5.5% |
| c | 59606 | 5.1% |
| i | 51809 | 4.4% |
| h | 50185 | 4.3% |
| Other values (13) | 301573 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 83219 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 256 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1172727 | |
| Common | 83475 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 152199 | |
| s | 119339 | 10.2% |
| t | 110624 | 9.4% |
| o | 93377 | 8.0% |
| a | 85428 | 7.3% |
| r | 84089 | 7.2% |
| u | 64498 | 5.5% |
| c | 59606 | 5.1% |
| i | 51809 | 4.4% |
| h | 50185 | 4.3% |
| Other values (13) | 301573 |
Common
| Value | Count | Frequency (%) |
| _ | 83219 | |
| 2 | 256 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1256202 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 152199 | |
| s | 119339 | 9.5% |
| t | 110624 | 8.8% |
| o | 93377 | 7.4% |
| a | 85428 | 6.8% |
| r | 84089 | 6.7% |
| _ | 83219 | 6.6% |
| u | 64498 | 5.1% |
| c | 59606 | 4.7% |
| i | 51809 | 4.1% |
| Other values (15) | 352014 |
| length_comment_title | length_comment_message | payment_installments | payment_value | nb_items | sum_price | sum_freight_value | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | order_status | review_score | payment_type | customer_state | product_category_name_english | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| length_comment_title | 1.000 | 0.315 | 0.004 | 0.034 | 0.026 | 0.028 | 0.053 | 0.031 | 0.006 | -0.011 | -0.032 | -0.003 | -0.020 | 0.022 | 0.080 | 0.028 | 0.013 | 0.034 |
| length_comment_message | 0.315 | 1.000 | 0.044 | 0.068 | 0.082 | 0.062 | 0.070 | -0.008 | -0.005 | 0.037 | 0.013 | 0.022 | 0.015 | 0.046 | 0.205 | 0.008 | 0.021 | 0.020 |
| payment_installments | 0.004 | 0.044 | 1.000 | 0.381 | 0.057 | 0.375 | 0.231 | 0.037 | 0.003 | 0.220 | 0.119 | 0.122 | 0.137 | 0.005 | 0.020 | 0.182 | 0.033 | 0.090 |
| payment_value | 0.034 | 0.068 | 0.381 | 1.000 | 0.221 | 0.990 | 0.566 | 0.193 | 0.008 | 0.519 | 0.268 | 0.347 | 0.275 | 0.012 | 0.014 | 0.006 | 0.015 | 0.101 |
| nb_items | 0.026 | 0.082 | 0.057 | 0.221 | 1.000 | 0.177 | 0.377 | -0.036 | -0.056 | -0.005 | 0.008 | 0.004 | 0.001 | 0.000 | 0.031 | 0.009 | 0.000 | 0.027 |
| sum_price | 0.028 | 0.062 | 0.375 | 0.990 | 0.177 | 1.000 | 0.469 | 0.196 | 0.012 | 0.506 | 0.256 | 0.339 | 0.265 | 0.011 | 0.012 | 0.008 | 0.013 | 0.092 |
| sum_freight_value | 0.053 | 0.070 | 0.231 | 0.566 | 0.377 | 0.469 | 1.000 | 0.100 | -0.010 | 0.419 | 0.273 | 0.272 | 0.262 | 0.003 | 0.015 | 0.000 | 0.030 | 0.054 |
| product_description_lenght | 0.031 | -0.008 | 0.037 | 0.193 | -0.036 | 0.196 | 0.100 | 1.000 | 0.155 | 0.100 | -0.011 | 0.132 | -0.060 | 0.005 | 0.011 | 0.012 | 0.019 | 0.212 |
| product_photos_qty | 0.006 | -0.005 | 0.003 | 0.008 | -0.056 | 0.012 | -0.010 | 0.155 | 1.000 | 0.014 | 0.009 | -0.068 | -0.004 | 0.015 | 0.011 | 0.000 | 0.013 | 0.150 |
| product_weight_g | -0.011 | 0.037 | 0.220 | 0.519 | -0.005 | 0.506 | 0.419 | 0.100 | 0.014 | 1.000 | 0.620 | 0.536 | 0.622 | 0.006 | 0.019 | 0.011 | 0.012 | 0.192 |
| product_length_cm | -0.032 | 0.013 | 0.119 | 0.268 | 0.008 | 0.256 | 0.273 | -0.011 | 0.009 | 0.620 | 1.000 | 0.260 | 0.639 | 0.009 | 0.014 | 0.012 | 0.011 | 0.260 |
| product_height_cm | -0.003 | 0.022 | 0.122 | 0.347 | 0.004 | 0.339 | 0.272 | 0.132 | -0.068 | 0.536 | 0.260 | 1.000 | 0.345 | 0.013 | 0.014 | 0.011 | 0.013 | 0.267 |
| product_width_cm | -0.020 | 0.015 | 0.137 | 0.275 | 0.001 | 0.265 | 0.262 | -0.060 | -0.004 | 0.622 | 0.639 | 0.345 | 1.000 | 0.000 | 0.011 | 0.010 | 0.012 | 0.290 |
| order_status | 0.022 | 0.046 | 0.005 | 0.012 | 0.000 | 0.011 | 0.003 | 0.005 | 0.015 | 0.006 | 0.009 | 0.013 | 0.000 | 1.000 | 0.163 | 0.040 | 0.023 | 0.026 |
| review_score | 0.080 | 0.205 | 0.020 | 0.014 | 0.031 | 0.012 | 0.015 | 0.011 | 0.011 | 0.019 | 0.014 | 0.014 | 0.011 | 0.163 | 1.000 | 0.010 | 0.048 | 0.046 |
| payment_type | 0.028 | 0.008 | 0.182 | 0.006 | 0.009 | 0.008 | 0.000 | 0.012 | 0.000 | 0.011 | 0.012 | 0.011 | 0.010 | 0.040 | 0.010 | 1.000 | 0.026 | 0.036 |
| customer_state | 0.013 | 0.021 | 0.033 | 0.015 | 0.000 | 0.013 | 0.030 | 0.019 | 0.013 | 0.012 | 0.011 | 0.013 | 0.012 | 0.023 | 0.048 | 0.026 | 1.000 | 0.030 |
| product_category_name_english | 0.034 | 0.020 | 0.090 | 0.101 | 0.027 | 0.092 | 0.054 | 0.212 | 0.150 | 0.192 | 0.260 | 0.267 | 0.290 | 0.026 | 0.046 | 0.036 | 0.030 | 1.000 |
| customer_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | review_score | length_comment_title | length_comment_message | review_answer_timestamp | payment_type | payment_installments | payment_value | product_most_frequent | nb_items | sum_price | sum_freight_value | customer_unique_id | customer_city | customer_state | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 9ef432eb6251297304e76186b10a928d | delivered | 2017-10-02 10:56:33 | 2017-10-02 11:07:15 | 2017-10-04 19:55:00 | 2017-10-10 21:25:13 | 2017-10-18 00:00:00 | 4.0 | 0.0 | 170.0 | 2017-10-12 03:43:48 | credit_card,voucher | 1.0 | 38.71 | 87285b34884572647811a353c7ac498a | 1.0 | 29.99 | 8.72 | 7c396fd4830fd04220f754e42b4e5bff | sao paulo | SP | 268.0 | 4.0 | 500.0 | 19.0 | 8.0 | 13.0 | housewares |
| 1 | b0830fb4747a6c6d20dea0b8c802d7ef | delivered | 2018-07-24 20:41:37 | 2018-07-26 03:24:27 | 2018-07-26 14:31:00 | 2018-08-07 15:27:45 | 2018-08-13 00:00:00 | 4.0 | 16.0 | 20.0 | 2018-08-08 18:37:50 | boleto | 1.0 | 141.46 | 595fac2a385ac33a80bd5114aec74eb8 | 1.0 | 118.70 | 22.76 | af07308b275d755c9edb36a90c618231 | barreiras | BA | 178.0 | 1.0 | 400.0 | 19.0 | 13.0 | 19.0 | perfumery |
| 2 | 41ce2a54c0b03bf3443c3d931a367089 | delivered | 2018-08-08 08:38:49 | 2018-08-08 08:55:23 | 2018-08-08 13:50:00 | 2018-08-17 18:06:29 | 2018-09-04 00:00:00 | 5.0 | 0.0 | 0.0 | 2018-08-22 19:07:58 | credit_card | 3.0 | 179.12 | aa4383b373c6aca5d8797843e5594415 | 1.0 | 159.90 | 19.22 | 3a653a41f6f9fc3d2a113cf8398680e8 | vianopolis | GO | 232.0 | 1.0 | 420.0 | 24.0 | 19.0 | 21.0 | auto |
| 3 | f88197465ea7920adcdbec7375364d82 | delivered | 2017-11-18 19:28:06 | 2017-11-18 19:45:59 | 2017-11-22 13:39:59 | 2017-12-02 00:28:42 | 2017-12-15 00:00:00 | 5.0 | 0.0 | 105.0 | 2017-12-05 19:21:58 | credit_card | 1.0 | 72.20 | d0b61bfb1de832b15ba9d266ca96e5b0 | 1.0 | 45.00 | 27.20 | 7c142cf63193a1473d2e66489a9ae977 | sao goncalo do amarante | RN | 468.0 | 3.0 | 450.0 | 30.0 | 10.0 | 20.0 | pet_shop |
| 4 | 8ab97904e6daea8866dbdbc4fb7aad2c | delivered | 2018-02-13 21:18:39 | 2018-02-13 22:20:29 | 2018-02-14 19:46:34 | 2018-02-16 18:17:02 | 2018-02-26 00:00:00 | 5.0 | 0.0 | 0.0 | 2018-02-18 13:02:51 | credit_card | 1.0 | 28.62 | 65266b2da20d04dbe00c5c2d3bb7859e | 1.0 | 19.90 | 8.72 | 72632f0f9dd73dfee390c9b22eb56dd6 | santo andre | SP | 316.0 | 4.0 | 250.0 | 51.0 | 15.0 | 15.0 | stationery |
| 5 | 503740e9ca751ccdda7ba28e9ab8f608 | delivered | 2017-07-09 21:57:05 | 2017-07-09 22:10:13 | 2017-07-11 14:58:04 | 2017-07-26 10:57:55 | 2017-08-01 00:00:00 | 4.0 | 0.0 | 0.0 | 2017-07-27 22:48:30 | credit_card | 6.0 | 175.26 | 060cb19345d90064d1015407193c233d | 1.0 | 147.90 | 27.36 | 80bb27c7c16e8f973207a5086ab329e2 | congonhinhas | PR | 608.0 | 1.0 | 7150.0 | 65.0 | 10.0 | 65.0 | auto |
| 6 | ed0271e0b7da060a393796590e7b737a | invoiced | 2017-04-11 12:22:08 | 2017-04-13 13:25:17 | NaN | NaN | 2017-05-09 00:00:00 | 2.0 | 0.0 | 36.0 | 2017-05-13 20:25:42 | credit_card | 1.0 | 65.95 | a1804276d9941ac0733cfd409f5206eb | 1.0 | 49.90 | 16.05 | 36edbb3fb164b1f16485364b6fb04c73 | santa rosa | RS | 0.0 | 0.0 | 600.0 | 35.0 | 35.0 | 15.0 | unknown |
| 7 | 9bdf08b4b3b52b5526ff42d37d47f222 | delivered | 2017-05-16 13:10:30 | 2017-05-16 13:22:11 | 2017-05-22 10:07:46 | 2017-05-26 12:55:51 | 2017-06-07 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-05-28 02:59:57 | credit_card | 3.0 | 75.16 | 4520766ec412348b8d4caa5e8a18c464 | 1.0 | 59.99 | 15.17 | 932afa1e708222e5821dac9cd5db4cae | nilopolis | RJ | 956.0 | 1.0 | 50.0 | 16.0 | 16.0 | 17.0 | auto |
| 8 | f54a9f0e6b351c431402b8461ea51999 | delivered | 2017-01-23 18:29:09 | 2017-01-25 02:50:47 | 2017-01-26 14:16:31 | 2017-02-02 14:08:10 | 2017-03-06 00:00:00 | 1.0 | 0.0 | 0.0 | 2017-02-05 01:58:35 | boleto | 1.0 | 35.95 | ac1789e492dcd698c5c10b97a671243a | 1.0 | 19.90 | 16.05 | 39382392765b6dc74812866ee5ee92a7 | faxinalzinho | RS | 432.0 | 2.0 | 300.0 | 35.0 | 35.0 | 15.0 | furniture_decor |
| 9 | 31ad1d1b63eb9962463f764d4e6e0c9d | delivered | 2017-07-29 11:55:02 | 2017-07-29 12:05:32 | 2017-08-10 19:45:24 | 2017-08-16 17:14:30 | 2017-08-23 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-08-18 01:47:32 | credit_card,voucher | 1.0 | 169.76 | 9a78fb9862b10749a117f7fc3c31f051 | 1.0 | 149.99 | 19.77 | 299905e3934e9e181bfb2e164dd4b4f8 | sorocaba | SP | 527.0 | 1.0 | 9750.0 | 42.0 | 41.0 | 42.0 | office_furniture |
| customer_id | order_status | order_purchase_timestamp | order_approved_at | order_delivered_carrier_date | order_delivered_customer_date | order_estimated_delivery_date | review_score | length_comment_title | length_comment_message | review_answer_timestamp | payment_type | payment_installments | payment_value | product_most_frequent | nb_items | sum_price | sum_freight_value | customer_unique_id | customer_city | customer_state | product_description_lenght | product_photos_qty | product_weight_g | product_length_cm | product_height_cm | product_width_cm | product_category_name_english | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99431 | 8e1ec396e317ff4c82a03ce16a0c3eb3 | delivered | 2017-10-27 15:21:00 | 2017-10-27 15:32:49 | 2017-10-30 15:44:34 | 2017-11-10 17:57:22 | 2017-11-22 00:00:00 | 5.0 | 0.0 | 77.0 | 2017-11-15 09:54:14 | credit_card | 3.0 | 164.30 | 595fac2a385ac33a80bd5114aec74eb8 | 1.0 | 142.50 | 21.80 | 1a3b8f1d0782ebedbcf220a96cbc1655 | maceio | AL | 178.0 | 1.0 | 400.0 | 19.0 | 13.0 | 19.0 | perfumery |
| 99432 | a2f7428f0cafbc8e59f20e1444b67315 | delivered | 2017-12-20 09:52:41 | 2017-12-20 10:09:52 | 2017-12-20 20:25:25 | 2018-01-26 15:45:14 | 2018-01-18 00:00:00 | 1.0 | 0.0 | 86.0 | 2018-01-21 02:51:39 | credit_card | 1.0 | 71.04 | 3d2c44374ee42b3003a470f3e937a2ea | 1.0 | 55.90 | 15.14 | a49e8e11e850592fe685ae3c64b40eca | campo do tenente | PR | 372.0 | 2.0 | 300.0 | 16.0 | 6.0 | 12.0 | musical_instruments |
| 99433 | da2124f134f5dfbce9d06f29bdb6c308 | delivered | 2017-10-04 19:57:37 | 2017-10-04 20:07:14 | 2017-10-05 16:52:52 | 2017-10-20 20:25:45 | 2017-11-07 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-10-23 14:48:40 | credit_card,voucher | 2.0 | 106.79 | 49d2e2460386273b195e7e59b43587c3 | 2.0 | 69.01 | 37.78 | c716cf2b5b86fb24257cffe9e7969df8 | cuiaba | MT | 180.0 | 3.0 | 750.0 | 26.0 | 15.0 | 26.0 | toys |
| 99434 | f01a6bfcc730456317e4081fe0c9940e | delivered | 2017-01-27 00:30:03 | 2017-01-27 01:05:25 | 2017-01-30 11:40:16 | 2017-02-07 13:15:25 | 2017-03-17 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-02-11 12:37:36 | credit_card,voucher | 5.0 | 389.43 | 9fc063fd34fed29ccc57b7f8e8d03388 | 1.0 | 370.00 | 19.43 | e03dbdf5e56c96b106d8115ac336f47f | divinopolis | MG | 657.0 | 1.0 | 750.0 | 38.0 | 12.0 | 25.0 | health_beauty |
| 99435 | 47cd45a6ac7b9fb16537df2ccffeb5ac | delivered | 2017-02-23 09:05:12 | 2017-02-23 09:15:11 | 2017-03-01 10:22:52 | 2017-03-06 11:08:08 | 2017-03-22 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-03-11 15:42:41 | credit_card | 3.0 | 155.99 | ea73128566d1b082e5101ce46f8107c7 | 1.0 | 139.90 | 16.09 | 831ce3f1bacbd424fc4e38fbd4d66d29 | sao paulo | SP | 254.0 | 2.0 | 2500.0 | 49.0 | 13.0 | 41.0 | furniture_decor |
| 99436 | 39bd1228ee8140590ac3aca26f2dfe00 | delivered | 2017-03-09 09:54:05 | 2017-03-09 09:54:05 | 2017-03-10 11:18:03 | 2017-03-17 15:08:01 | 2017-03-28 00:00:00 | 5.0 | 0.0 | 0.0 | 2017-03-23 11:02:08 | credit_card | 3.0 | 85.08 | ac35486adb7b02598c182c2ff2e05254 | 1.0 | 72.00 | 13.08 | 6359f309b166b0196dbf7ad2ac62bb5a | sao jose dos campos | SP | 1517.0 | 1.0 | 1175.0 | 22.0 | 13.0 | 18.0 | health_beauty |
| 99437 | 1fca14ff2861355f6e5f14306ff977a7 | delivered | 2018-02-06 12:58:58 | 2018-02-06 13:10:37 | 2018-02-07 23:22:42 | 2018-02-28 17:37:56 | 2018-03-02 00:00:00 | 4.0 | 0.0 | 44.0 | 2018-03-02 17:50:01 | credit_card | 3.0 | 195.00 | f1d4ce8c6dd66c47bbaa8c6781c2a923 | 1.0 | 174.90 | 20.10 | da62f9e57a76d978d02ab5362c509660 | praia grande | SP | 828.0 | 4.0 | 4950.0 | 40.0 | 10.0 | 40.0 | baby |
| 99438 | 1aa71eb042121263aafbe80c1b562c9c | delivered | 2017-08-27 14:46:43 | 2017-08-27 15:04:16 | 2017-08-28 20:52:26 | 2017-09-21 11:24:17 | 2017-09-27 00:00:00 | 5.0 | 0.0 | 28.0 | 2017-09-22 23:10:57 | credit_card | 5.0 | 271.01 | b80910977a37536adeddd63663f916ad | 1.0 | 205.99 | 65.02 | 737520a9aad80b3fbbdad19b66b37b30 | nova vicosa | BA | 500.0 | 2.0 | 13300.0 | 32.0 | 90.0 | 22.0 | home_appliances_2 |
| 99439 | b331b74b18dc79bcdf6532d51e1637c1 | delivered | 2018-01-08 21:28:27 | 2018-01-08 21:36:21 | 2018-01-12 15:35:03 | 2018-01-25 23:32:54 | 2018-02-15 00:00:00 | 2.0 | 0.0 | 53.0 | 2018-01-27 09:16:56 | credit_card | 4.0 | 441.16 | d1c427060a0f73f6b889a5c7c61f2ac4 | 2.0 | 359.98 | 81.18 | 5097a5312c8b157bb7be58ae360ef43c | japuiba | RJ | 1893.0 | 1.0 | 6550.0 | 20.0 | 20.0 | 20.0 | computers_accessories |
| 99440 | edb027a75a1449115f6b43211ae02a24 | delivered | 2018-03-08 20:57:30 | 2018-03-09 11:20:28 | 2018-03-09 22:11:59 | 2018-03-16 13:08:30 | 2018-04-03 00:00:00 | 5.0 | 0.0 | 0.0 | 2018-03-17 16:33:31 | debit_card | 1.0 | 86.86 | 006619bbed68b000c8ba3f8725d5409e | 1.0 | 68.50 | 18.36 | 60350aa974b26ff12caad89e55993bd6 | lapa | PR | 569.0 | 1.0 | 150.0 | 16.0 | 7.0 | 15.0 | health_beauty |